A ricle Tests for Two Trees Using Likelihood Methods

نویسندگان

  • Edward Susko
  • Oliver Pybus
چکیده

This article considers two similar likelihood-based test statistics for comparing two fixed trees, the Kishino-Hasegawa (KH) test statistic and the likelihood ratio (LR) statistic, as well as a number of different methods for determining thresholds to declare a significant result. An explanation is given for why the KH test, which uses the KH test statistic and normal theory thresholds, need not give correct type I error probabilities under the appropriate null hypothesis. Simulations show that the KH test tends to give much smaller type I error probabilities than expected. The article presents a computationally efficient normal-theory parametric bootstrap method for determining better KH test statistic thresholds. For the LR statistic, existing mixture of chi-squares results for determining thresholds are extended to cases in which a tree with two or three zero edge-lengths exhibits the two trees being compared. The resulting chi-bar test and use of the KH test statistic with normal bootstrap are shown through simulation to give good performance but are more difficult to implement than the KH test. Two conservative approaches are presented which require only log likelihoods and simple chi-square thresholds. While they did not perform as well as chi-bar and normal bootstrap methods in the simulations considered, they gave better performance than the KH test and have just as simple an implementation. As a by-product of parametric bootstrap considerations, an adjustment to the Swofford-Olsen-Waddell-Hillis (SOWH) test is proposed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A ricle Alignment Errors Strongly Impact Likelihood-Based Tests for Comparing Topologies

Estimating phylogenetic trees from sequence data is an extremely challenging and important statistical task. Within the maximum-likelihood paradigm, the best tree is a point estimate. To determine how strongly the data support such an evolutionary scenario, a hypothesis testing methodology is required. To this end, the Kishino–Hasegawa (KH) test was developed to determine whether one topology i...

متن کامل

A ricle A Method of Alignment Masking for Refining the Phylogenetic Signal of Multiple Sequence Alignments

Inaccurate inference of positional homologies in multiple sequence alignments and systematic errors introduced by alignment heuristics obfuscate phylogenetic inference. Alignment masking, the elimination of phylogenetically uninformative or misleading sites from an alignment before phylogenetic analysis, is a common practice in phylogenetic analysis. Although masking is often done manually, aut...

متن کامل

Capability of Rapid Eye Satellite Imagery to Map the Distribution of Canopy Trees in Dashtebarm Forest Area of Fars Province

      In this research, the capability of Rapid Eye satellite imagery for mapping the crown distribution of oak trees in Zagros forests was investigated in the Dashtebarm forest area of ​​Kazeroun, Fars province. In this study, data quality was investigated geometrically and radiometrically and geometric correction of the images was done using a linear method and using precision ground control ...

متن کامل

Partitioned likelihood support and the evaluation of data set conflict.

In simultaneous analyses of multiple data partitions, the trees relevant when measuring support for a clade are the optimal tree, and the best tree lacking the clade (i.e., the most reasonable alternative). The parsimony-based method of partitioned branch support (PBS) "forces" each data set to arbitrate between the two relevant trees. This value is the amount each data set contributes to clade...

متن کامل

Fitting Tree Height Distributions in Natural Beech Forest Stands of Guilan (Case Study: Masal)

        In this research, modeling tree height distributions of beech in natural forests of Masal that is located in Guilan province; was investigated. Inventory was carried out using systematic random sampling with network dimensions of 150×200 m and area sample plot of 0.1 ha. DBH and heights of 630 beech trees in 30 sample plots were measured. Beta, Gamma, Normal, Log-normal and Weibull prob...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014